Towards modeling user behavior in interactions mediated through an automated bidirectional speech translation system

نویسندگان

  • JongHo Shin
  • Panayiotis G. Georgiou
  • Shrikanth S. Narayanan
چکیده

This paper addresses modeling user behavior in interactions between two people who do not share a common spoken language and communicate with the aid of an automated bidirectional speech translation system. These interaction settings are complex. The translation machine attempts to bridge the language gap by mediating the verbal communication, noting however that the technology may not be always perfect. In a step toward understanding user behavior in this mediated communication scenario, usability data from doctor–patient dialogs involving a two way English–Persian speech translation system are analyzed. We specifically consider user behavior in light of potential uncertainty in the communication between the interlocutors. We analyze the Retry (Repeat and Rephrase) versus Accept behaviors in the mediated verbal channel and as a result identify three user types – Accommodating, Normal and Picky, and propose a dynamic Bayesian network model of user behavior. To validate the model, we performed offline and online experiments. The experimental results using offline data show that correct user type is clearly identified as a user keeps his/her consistent behavior in a given interaction condition. In the online experiment, agent feedback was presented to users according to the user types. We show high user satisfaction and interaction efficiency in the analysis of user interview, video data, questionnaire and log data. 2009 Elsevier Ltd. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An English-persian Automatic Speech Translator: Recent Developments in Domain Portability and User Modeling

In this paper we describe the English-Persian speech to speech translation device, and provide insight on the lessons learned during the first phase of development of this system. We start by giving an overview of the underlying components of the device: the front end ASR, the machine translation system and the speech generation system. Challenges such as the sparseness of available spoken lang...

متن کامل

Towards modeling user behavior in human-machine interactions: Effect of Errors and Emotions

Data-driven approaches to spoken dialog strategy design rely on a sound understanding and modeling of user behavior in their interaction with machines. The spoken language usermachine communication channel is inherently noisy; noise in the channel may be due to errors in machine speech recognition, language understanding or other machine/user communication uncertainty and errors. Hence, annotat...

متن کامل

IraqComm and FlexTrans: A Speech Translation System and Flexible Framework

SRI International's IraqComm system performs bidirectional speech-to-speech machine translation between English and Iraqi Arabic in the domains of force protection, municipal and medical services, and training. The system was developed primarily under DARPA's TRANSTAC Program and includes: speech recognition components using SRI's Dynaspeak engine; MT components using SRI's Gemini and SRInte...

متن کامل

Modeling Emotional State and Personality for Conversational Agents

We describe an architecture for constructing a character-based agent based on speech and graphical interactions. The architecture uses models of emotions and personality encoded as Bayesian networks to 1) diagnose the emotions and personality of the user, and 2) generate appropriate behavior by an automated agent in response to the user’s input. Classes of interaction that are interpreted and/o...

متن کامل

Interactive Speech Translation in the Diplomat Project 2 Multi-engine Machine Translation

The DIPLOMAT rapid-deployment speech translation system is intended to allow na ve users to communicate across a language barrier, without strong domain restrictions, despite the error-prone nature of current speech and translation technologies. Achieving this ambitious goal depends in large part on allowing the users to interactively correct recognition and translation errors. We brieey presen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Speech & Language

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2010